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Bell's theorem is a fundamental result in quantum mechanics: it discriminates between quantum 
mechanics and all theories where probabilities in measurement results arise from the ignorance of 
pre-existing local properties. We give an extremely simple proof of Bell's inequality: a single figure 
suffices. This simplicity may be useful in the unending debate of what exactly the Bell inequality 
means, since the hypothesis at the basis of the proof become extremely transparent. It is also a 
useful didactic tool, as the Bell inequality can be explained in a single intuitive lecture. 
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Introduction: — Einstein had a dream. He believed 
quantum meehanics was an incomplete description of re- 
ality iQ and that its completion might explain the trou- 
blesome fundamental probabilities of quantum mechanics 
as emerging from some hidden degrees of freedom: proba- 
bilities would arise because of our ignorance of these "hid- 
den variables" . His dream was that probabilities in quan- 
tum mechanics might turn out to have the same meaning 
as probabilities in classical thermodynamics, where they 
refer to our ignorance of the microscopic degrees of free- 
dom (e.g. the position and velocity of each gas molecule): 
he wrote, "the statistical quantum theory would, within 
the framework of future physics, take an approximately 
analogous position to the statistical mechanics within the 
framework of classical mechanics" [Q. 

A decade after Einstein's death, John Bell ^ shat- 
tered this dream in the worst possible way from Einstein's 
point of view Q ; any completion of quantum mechanics 
with hidden variables would be incompatible with rela- 
tivistic causality! The essence of Bell's theorem is that 
quantum mechanical probabilities cannot arise from the 
ignorance of local pre-existing variables. In other words, 
if we want to assign pre-existing (but hidden) properties 
to explain probabilities in quantum measurements, these 
properties must be non-local. This non-locality is of the 
worst possible kind: an agent with access to the non-local 
variables would be able to transmit information instantly 
to a distant location, thus violating relativistic causality 
and awakening the nastiest temporal paradoxes . 

Modern formulations of quantum mechanics must in- 
corporate Bell's result at their core: either they refuse 
the idea that measurements uncover pre-existing proper- 
ties, or they must make use of non-local properties. In 
the latter case, they must also introduce some censor- 
ship mechanism to prevent the use of hidden variables 
to transmit information. An example of the first formu- 
lation is the conventional "Copenhagen interpretation" 
of quantum mechanics, which (thanks to complementar- 
ity) states that the properties arise from the interaction 
between the quantum system and the measurement ap- 
paratus, they are not pre-existing: "unperformed exper- 
iments have no results" |6j. An example of the second 
formulation is the "de Broglie-Bohm interpretation" of 



quantum mechanics that assumes that particle trajecto- 
ries are hidden variables (they "exist" independently of 
position measurements) . 

Bell's result is at the core of modern quantum mechan- 
ics, as it elucidates the theory's precarious equilibrium 
with relativistic causality. It has spawned an impressive 
amount of research. However, it is often ignored in ba- 
sic quantum mechanics courses since traditional proofs 
of Bell's theorem are rather cumbersome and often over- 
burdened by philosophical considerations. Here we give 
an extremely simple graphical proof of Mermin's version 
10, 1^ of Bell's theorem. The simplicity of the proof is key 
to clarifying all the theorem's assumptions, the identifi- 
cation of which generated a large debate in the literature 
(e.g. see @). 

Bell's theorem: — Let us define "local" a theory where 
the outcomes of an experiment on a system are indepen- 
dent of the actions performed on a different system which 
has no causal connection with the first. For example, the 
temperature of my room is independent on whether you 
choose to wear a purple tie today. Einstein's relativity 
provides a stringent condition for causal connections: if 
two events are outside their respective light cones, there 
cannot be any causal connection among them. 
Let us define 



"counterfactual" |1C|, 11 a theory whose 
experiments uncover properties that are pre-existing. In 
other words, in a counterfactual theory it is meaningful 
to assign a property to a system (e.g. the position of an 
electron) independently of whether the measurement of 
such property is carried out. [Sometime this counterfac- 
tual definiteness property is also called "realism" , but it 
is best to avoid such philosophically laden term to avoid 
misconceptions .] 

Bell's theorem can be phrased as "quantum mechanics 
cannot be both local and counterfactual". A logically 
equivalent way of stating it is "quantum mechanics is 
either non-local or non-counterfactual" . 

To prove this theorem. Bell provided an inequality (re- 
ferring to correlations of measurement results) that is sat- 
isfied by all local and counterfactual theories. He then 
showed that quantum mechanics violates this inequality, 
and hence cannot be local and counterfactual. 

All experiments |12] performed to date have shown 
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that Bell inequalities are violated, suggesting that our 
world cannot be both local and countcrfactual. How- 
ever, it should be noted that no experiment up to now 
has been able to test Bell inequalities rigorously, because 
additional assumptions are required to take care of exper- 
imental imperfections. These assumptions are all quite 
reasonable, so that only conspiratorial alternatives to 
quantum mechanics (where experimental imperfections 
are fine-tuned to the properties of the objects jTsj) have 
yet to be ruled out. In the next couple of years the defini- 
tive Bell inequality experiment will be performed: many 
research groups worldwide are actively pursuing it. 

[If we want to be extremely pedantic in enumerating 
the hypothesis at the basis of Bell's theorem, we must 
also request (1) that our choice of which experiment 
to perform is independent of the properties of the ob- 
ject to be measured (technically, "freedom of choice" or 
"no super-determinism" Q): e.g., if we decided to mea- 
sure the color of red objects only, we would falsely con- 
clude that all objects are red; (2) that future outcomes 
of the experiment do not influence which apparatus set- 
tings were previously chosen (whereas clearly the ap- 
paratus settings will influence the outcomes): a trivial 
causality requirement (technically, "measurement inde- 
pendence" ) . These two hypothesis are usually left im- 
plicit because science would be impossible without them.] 

Proof of Bell's theorem: — We use the Bell inequal- 
ity proposed by Preskill |^, following Mermin's sugges- 
tion [0. Suppose we have two identical objects, namely 
they have the same properties. Suppose also that these 
properties are predetermined (countcrfactual definite- 
ness) and not generated by their measurement, and that 
the determination of the properties of one object will not 
influence any property of the other object (locality). 

We will only need three properties A, B, and C that 
can each take two values: "0" and "1". For example, if 
the objects are coins, then A = might mean that the 
coin is gold and A = 1 that the coin is copper (property 
A, material) , B — means the coin is shiny and B = 1 it 
is dull (property B, texture), and C = means the coin 
is large and C = 1 it is small (property C, size). 

Suppose I do not know the properties because the two 
coins are a gift in two wrapped boxes: I only know the gift 
is two identical coins, but I do not know whether they are 
two gold, shiny, small coins (A = 0, B = 0, C = 1) or two 
copper, shiny, large coins (1,0,0) or two gold, dull, large 
coins (1, 1,0), etc. I do know that the properties "exist" 
(namely, they are countcrfactual and predetermined even 
if I cannot see them directly) and they are local (namely, 
acting on one box will not change any property of the 
coin in the other box: the properties refer separately to 
each coin). These are quite reasonable assumptions for 
two coins! My ignorance of the properties is expressed 
through probabilities that represent either my expecta- 
tion of finding a property (Bayesian view), or the result of 
performing many repeated experiments with boxes and 



coins and averaging over some possibly hidden variable, 
typically indicated with the letter A Q], that determines 
the property (frequentist view) For example, I might 
say the gift bearer will give me two gold coins with a 20% 
probability (he is stingy, but not always). 

Bell's inequality refers to the correlation among mea- 
surement outcomes of the properties: call Psame{A, B) 
the probability that the properties A of the first ob- 
ject and B of the second are the same: A and B are 
both (the first coin is gold and the second is shiny) 
or they are both 1 (the first is copper and the second 
is dull). For example, Psame{A,B) — 1/2 tells me that 
with 50% chance A = B (namely they are both or both 
1). Since the two coins have equal counterfactual prop- 
erties, this also implies that with 50% chance I get two 
gold shiny coins or two copper dull coins. Note that the 
fact that the two coins have the same properties means 

that P,ameiA,A) = P.ameiB , B) = Psame{C,C) = 1: if 

one is made of gold, also the other one will be, or if one 
is made of copper, also the other one will be, etc. 

Bell's inequality J^: Under the conditions that three 
arbitrary two-valued properties A, i3, C satisfy counter- 
factual definiteness and locality, and that Psame{X^ X) = 
1 for X = A,B,C (i.e. the two objects have same proper- 
ties), the following inequality among correlations holds, 

same Psatne 

{A,C)+Psa,ne{B.C)^l , (1) 

namely, a Bell inequality. The proof of such inequality 
is given graphically in Fig. |l|. The inequality basically 
says that the sum of the probabilities that the two prop- 
erties are the same if I consider respectively A and B, 
A and C, and B and C must be larger than one. This 
is intuitively clear: since the two coins have the same 
properties, the sum of the probabilities that the coins 
are gold and shiny, copper and dull, gold and large, cop- 
per and small, shiny and small, dull and large is greater 
than one: all the combinations have been counted, pos- 
sibly more than once. [In Fig. || the events to which the 
probabilities represented by the Venn diagrams of Fig. |l| 
refer are made explicit.] 

This is true, of course, only if the two objects have 
same counterfactual properties and the measurement of 
one does not affect the outcome of the other. If we lack 
counterfactual properties, we cannot infer that the first 
coin is shiny only because we measured the second to 
be shiny, even if we know that the two coins have the 
same properties: without counterfactual definiteness, we 
cannot even speak of the first coin's texture unless we 
measure it. Moreover, if a measurement of the second 
coin's texture can change the one of the first coin (non- 
locality) again we cannot infer the first coin's texture 
from a measurement of the second: even if we know that 
the initial texture of the coins was the same, the mea- 
surement on the second may change such property of 
the first. The "counterfactual definiteness" hypothesis 
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P^au,e{A, B) + P i {A,B) - 1 




(a) 



Psam.iA, B) + P,a,ne{A, C) + Psa,ne{B , C) > 




and |ci) is orthogonal to |co). It is also easy to check that 




FIG. 1: Proof of Bell inequality (|l|) using areas to represent 
probabilities, (a) The dashed area represents the probabihty 
that property A of the first object and B of the second are 
equal (both 1 or both 0): PeameiA, B). The white area rep- 
resents the probability that they are different: Pdif f{A, B). 
The whole circle has area 1 — Psame{A, B) + Pdiff{A,B). 

(b) The gray area represents the probability that A and C 
are equal, and the non-gray area represents the probability 
that A and C are different. If A of the first object is different 
from both B and C of the second (dotted area) , then B and C 
of the second object must be the same. Hence, the probabil- 
ity that B and C are the same must be larger than (or equal 
to) the dotted area: since B is the same for the two objects, 
Psame {B , C) must be larger than (or equal to) the dotted area. 

(c) The quantity Psan^e{A,B) + P,ameiA,C) + Psame{B,C) is 
hence larger than (or equal to) the sum of the dashed -I- gray 
+ dotted areas, which is in turn larger than (or equal to) the 
full circle of area 1: this proves the Bell inequality (jl|). The 
reasoning fails if we do not employ counterfactual properties, 
for example if complementarity prevents us from assigning 
values to both properties B and C of the second object. It 
also fails if we employ non-local properties, for example if a 
measurement of B on an object to find its value changes the 
value of A of the other object. 



we used here can be relaxed somewhat, as shown in the 
appendix (suggested only to more advanced readers). 

To prove Bell's theorem, we now provide a quantum 
system that violates the above inequality. Consider two 
two-level systems (qubits) in the joint entangled state 
|$+) = (|00) + |11))/V2, and consider the two-valued 
properties A, B, and C obtained by projecting the qubit 
on the states 
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so that the two qubits have the same properties, namely 

Psame{A,A) = Psame{B,B) = Psame(CTC) = I- the 

measurement of the same property on both qubits always 
yields the same outcome, both or both 1. 

We are now ready to calculate the quantity on the left 
of Bell's inequality (|l|). Just write the state \ ^^) in terms 
of the eigenstates of the properties A, B and C. E.g., it 
is easy to find the value of Psame{A^ B) if we write 



1$+) = 



\ao){\ho) + y3|bi)) + |ai)(|V3|bo) - \b^)) 
2V2 



In fact, the probability of obtaining zero for both prop- 
erties is the square modulus of the coefficient of |ao)|6o)i 



namely |l/2\/2p 



1/ 



while the probability of obtain- 



ing one for both is the square modulus of the coefficient 
of |ai)|6i), again 1/8. Hence, Psame{A,B) = 1/8 + 1/8 = 
1/4. Analogously, we find that Psame{ATC) = 1/4 and 
that Psame{B, C) = 1/4 by expressing the state as 

_ |ao)(|co)+V3|ci))-|ai)(|x/3|co)-|ci)) 
\^ + \ _ (|bo) + 73|fci))(|co) + 73|ci))-(v/3|bo)-|fci))(73|co)-|ci)) 



Summarizing, we have found 

Psame{A, B) + Psame{A, C) -\~ Psame{B , C) 



(4) 



which violates Bell's inequality (|i|). 

This proves Bell's theorem: all local counterfactual 
theories must satisfy inequality (^ which is violated by 
quantum mechanics. Then, quantum mechanics cannot 
be a local counterfactual theory: it must either be non- 
counterfactual (as in the Copenhagen interpretation) or 
non-local (as in the de Broglie-Bohm interpretation). 
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FIG. 2: Explicit depiction of the properties whose proba- 
bilities are represented by the areas of the Venn diagrams in 
Fig. 0. The properties are represented by a triplet of num- 
bers (A, B, C) that indicate the (counterfactual, local) values 
of the properties A, B, and C for both objects. Note that in 
the dotted area A must be different from both B and C, so 
that B and C must be equal there {B and C are equal also 
in the intersection between the two smaller sets, but that is 
irrelevant to the proof). 



the spirit of the original proof of BcU's theorem [Q, ^ , 
one can relax the "counterfactual definiteness" hypothe- 
sis somewhat. In fact, instead of supposing that there are 
some pre-existing properties of the objects (counterfac- 
tual definiteness) , we can suppose that the properties are 
not completely pre-determined, but that a hidden vari- 
able A exists and the properties have a probability dis- 
tribution that is a function of A. The "hidden variable 
model" hypothesis is weaker than counterfactual definite- 
ness: if the properties are pre-existing, then their prob- 
ability distribution in A is trivial: there is a value of A 
that determines uniquely the property, e.g. a value Aq 
such that the probability Pi{a = 0|^, Aq) = 1 and hence 
Pi{a = 1|^, Ao) = 0, namely it is certain that property A 
for object i has value a = for A = Aq. 

Following p5| ] , we now show that a local, hidden vari- 
able model together with the request that the two sys- 
tems can have identical properties, implies counterfactual 
definiteness. This means that we can replace "counter- 
factual definiteness" with "hidden variable model" in the 
proof of Bell theorem, which, with these relaxed hypoth- 
esis states that "no local hidden variable model can rep- 
resent quantum mechanics" . 

Call P{x, x'\X, X' , A) the probability distribution (due 
to the hidden variable model) that the measurement of 
the property X on the first object gives result x and 
the measurement of X' on the second gives x' , where 
X,X' = A,B,C denote the three two-valued properties 
A, B, and C. The locality means that the probability dis- 
tributions of the properties of the two objects factorize, 
namely P{x, x'\X, X' , X) = Pi{x\X, X)P2ix'\X' , X): the 
factorization of the probability means that the probabil- 
ity of seeing some value x of the property X for object 
1 is independent of which property X' one chooses to 
measure and what result x' one obtains on object 2 (and 
viceversa) . 

If two objects have the same property, then 
PsameiXjX) = 1, namely the probability that a mea- 
surement of the same property X on the two objects 
gives opposite results (say, x = I and x' = 0) is null. 
In formulas, 



P{x = \,x' = OjX, X, A) ]9(A) = 



(5) 



where the "Ylix emphasizes that we are averaging over the 
hidden variables (since they are hidden): p(A) is the prob- 
ability distribution of the hidden variable A in the initial 
(joint) state of the two systems. Note that in Eq. (|^) 
we are measuring the same property X on both objects 
but we are looking for the probability of obtaining op- 
posite results x' ^ x. As argued above, locality implies 
factorization of the probability, namely Eq. (p[) becomes 



APPENDIX: Hidden variable models: — This ap- 
pendix is addressed only to more advanced readers. In 



Pi{x = 1\X, A) P2ix' = 0\X, A) piX) = . (6) 

A 
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Since Pi, P2, and p are probabilities, they must be pos- 
itive. Consider the values of A for which p{\) > 0: the 
above sum can be null only if either Pi or P2 is null. 
Namely if Pi (a; = A) = (which implies that X has 
the predetermined value a; = 0) or if P2(x' = 0|X, A) = 
(which means that X has predetermined value x' = 1): 
we remind that counterfactual definiteness means that 
Pi{x\X, A) is either or 1: it is equal to if the property 
X of the ith object does not have the value x, and it is 
equal to 1 if it does have the value x. We have, hence, 
shown that Eq. (^ implies counterfactual definiteness for 
property X: its value is predetermined for one of the two 
objects. 

Summarizing, if we assume that a local hidden variable 
model admits two objects that have the same values of 



their properties, then we can prove counterfactual defi- 
niteness. This means that we can relax the "counterfac- 
tual definiteness" hypothesis in the proof of the Bell theo- 
rem, replacing it with the "existence of a hidden variable 
model" , so that the Bell theorem takes the meaning that 
"no local hidden variable model can describe quantum 
mechanics" [the hypothesis that two objects can have the 
same values for the properties is implicit in the fact that 
such objects exist in quantum mechanics, see Eq. (||)]. 
Namely, if we want to use a hidden variable model to 
describe quantum mechanics (as in the de Broglie-Bohm 
interpretation), such model must be non-local. Other- 
wise we cannot use a hidden variable model (as in the 
Copenhagen interpretation) . 



